首页> 外文OA文献 >Complexity of stochastic branch and bound methods for belief tree search in Bayesian reinforcement learning

【2h】

Complexity of stochastic branch and bound methods for belief tree search in Bayesian reinforcement learning

机译：信念树搜索的随机分支和约束方法的复杂性在贝叶斯强化学习中

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

There has been a lot of recent work on Bayesian methods for reinforcementlearning exhibiting near-optimal online performance. The main obstacle facingsuch methods is that in most problems of interest, the optimal solutioninvolves planning in an infinitely large tree. However, it is possible toobtain stochastic lower and upper bounds on the value of each tree node. Thisenables us to use stochastic branch and bound algorithms to search the treeefficiently. This paper proposes two such algorithms and examines theircomplexity in this setting.

机译：关于增强学习的贝叶斯方法，最近有很多工作表现出近乎最佳的在线性能。这些方法面临的主要障碍是，在大多数感兴趣的问题中，最佳解决方案涉及无限大树中的规划。但是，有可能获得每个树节点的值的随机上下限。这使我们能够使用随机分支定界算法来高效地搜索树。本文提出了两种这样的算法，并在这种情况下检查了它们的复杂性。

著录项

作者
Dimitrakakis, Christos;
展开▼
作者单位

展开▼
年度 2009
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. Learning Bayesian Belief Networks Based on the MDL Principle: An Efficient Algorithm Using the Branch and Bound Technique [J] . Joe SUZUKI IEICE Transactions on Information and Systems . 1999,第2期

机译：基于MDL原理的贝叶斯信念网络学习：一种使用分支定界技术的高效算法
2. On the Complexity of Branch-and-Bound Search for Random Trees [J] . Luc Devroye, Carlos Zamora-Cura Random structures & algorithms . 1999,第4期

机译：关于随机树的分支定界搜索的复杂性
3. Monte-Carlo tree search for Bayesian reinforcement learning [J] . Ngo Anh Vien, Wolfgang Ertel, Viet-Hung Dang, Applied Intelligence . 2013,第2期

机译：蒙特卡洛树搜索用于贝叶斯强化学习
4. COMPLEXITY OF STOCHASTIC BRANCH AND BOUND METHODS FOR BELIEF TREE SEARCH IN BAYESIAN REINFORCEMENT LEARNING [C] . Christos Dimitrakakis International Conference on Agents and Artificial Intelligence . 2010

机译：贝叶斯加固学习中随机分支和拟订方法的复杂性
5. Bayesian Methods for Knowledge Transfer and Policy Search in Reinforcement Learning. [D] . Wilson, Aaron. 2012

机译：强化学习中的知识转移和策略搜索的贝叶斯方法。
6. Enhancing Biomolecular Samplingwith Reinforcement Learning: A Tree Search Molecular Dynamics SimulationMethod [O] . Kento Shin, Duy Phuoc Tran, Kazuhiro Takemura, 2019

机译：增强生物分子采样与强化学习：树搜索分子动力学模拟方法
7. Complexity of stochastic branch and bound methods for belief tree search in Bayesian reinforcement learning [O] . Christos Dimitrakakis 2009

机译：贝叶斯强化学习中信念树搜索的随机分支和约束方法的复杂性

Complexity of stochastic branch and bound methods for belief tree search in Bayesian reinforcement learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅